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KERATINOCYTE GROWTH FACTOR ANALOGS 

Field of the Invention 

5 The present invention relates to recombinant 

DNA technology and protein engineering. Specifically, 
recombinant DNA methodologies have been applied to 
generate polypeptide analogs of keratinocyte growth 
factor (KGF) , a potent mitogen of non- fibroblast 
10 epithelial cell growth, wherein the analogs have 

improved stability as compared to that of the parent 
KGF. 

3flgKqrQun<3 

15 

The complex process of tissue generation and 
regeneration is mediated by a number of protein factors 
sometimes referred to as soft tissue growth factors. 
These molecules are generally released by one cell 

20 type and act to influence proliferation of other cell 
types. (Rubin et al. (1989), Proc. Nafl. Acad. Sci. 
USA, &£: 802-806). Some soft tissue growth factors axe 
secreted by particular cell types and influence the 
proliferation, differentiation and/or maturation of 

25 responsive cells in the development of multicellular 
organisms (Finch et al. (1989), Science, 245:752-755). 
In addition to their roles in developing organisms, some 
are significant in the continued health and maintenance 
of more mature systems. For instance, in mammals there 

30 are many systems where rapid cell turnover occurs. Such 
systems include the skin and the gastrointestinal tract, 
both of which are comprised of epithelial cells. 
Included within this group of soft tissue growth factors 
is a protein family of fibroblast growth factors (FGFs) . 

35 There are currently eight known FGF family 

members which share a r elatedness among primary 



W 96/11951 PCT/US95/13075 

- 2 - 



structures: basic fibroblast growth factor, bFGF 
(Abraham et al. (1986), EMBO J. , £:2523-2528) ; acidic 
fibroblast growth factor, aFGF (Jaye et al. (1986), 
Science, 212:541-545); int-2 gene product, int-2 
5 (Dickson & Peters (1987), Nature, 126:833); hst/kFGF 
(Delli-Bovi et al . (1987), Cell, 50:729-737 and 
Yoshida et al. (1987), Proc. Natl. Acad. Sci. USA, 
M:7305-7309) ; FGF-5 (Zhan et al . (1988), Mol. Cell. 
Biol., 8:3487-3495); FGF-6 (Maries et al . (1989), 
10 Oncogene, 4:335-340); keratinocyte growth factor (Finch 
et al. (1989), Science, 21:752-755) and hisactophilin 
(Habazzettl et al. (1992), Nature, 1^2:855-858). 

Among the FGF family of proteins, keratinocyte 
growth factor ( "KGF" ) is a unique effector of non- 
15 fibroblast epithelial (particularly keratinocyte) cell 
proliferation derived from mesenchymal tissues. The 
term "native KGF" refers to a natural human (hKGF) or 
recombinant (rKGF) polypeptide (with or without a signal 
sequence) as depicted by the amino acid sequence 
20 presented in SEQ ID NO: 2 or an allelic variant thereof. 
[Unless otherwise indicated, amino acid numbering for 
molecules described herein shall correspond to that 
presented for the mature form of the native molecule 
(i.e., minus the signal sequence), as depicted by amino 
25 acids 32 to 194 of SEQ ID NO:2.] 

Native KGF may be isolated from natural human 
sources (hKGF) or produced by recombinant DNA techniques 
(rKGF) (Finch et al. (1989), supra; Rubin et al. (1989), 
supra; Ron et al. (1993), The Journal of Biological 
30 Chemistry, 268(4) :2984-2988: and Yan et al. (1991), In 
Vitro Cell, Dev. Biol., 22&:437-438) . 

It is known that native KGF is relatively 
tins table in the aqueous state and that it undergoes 
chemical and physical degradation resulting in a loss of 
35 biological activity during processing and storage (Chen 
et al. (1994), Pharmaceutical Research, 11:1582-1589). 
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Native KGF is prone also to aggregation at elevated 
temperatures and it becomes inactivated under acidic 
conditions (Rubin et al. (1989), Proc. Natl. Acad. Sci. 
USA, £6:802-806) . Aggregation of native KGF in aqueous 
5 solution also results in inactivated protein. This is 
disadvantageous because such loss of activity makes it 
impractical to store aqueous formulations of native KGF 
proteins for extended periods of time or to administer 
the protein over extended periods. Moreover, this is 

10 particularly problematic when preparing pharmaceutical 
formulations, because aggregated proteins have been 
known to be immunogenic (Cleland et al . (1993), Crit. 
Rev. Therapeutic Drug Carrier Systems, 10:307-377; 
Robbins et al. (1987), Diabetes, 16:838-845; and 

15 Pinckard et al. (1967), Clin. Exp. Immunol., 2:331-340). 

Recombinant DNA technology has been utilized 
to modify the sequences of various FGF family members. 
For example, bFGF and aFGF have been modified by 
deleting or substituting positively- charged residues, 

20 which are important for heparin binding with neutral or 
negatively-charged amino acids. It was reported that 
the modified molecules resulted in reduced heparin 
binding activity. Accordingly, it was taught that the 
amount of modified molecule sequestered by heparin 

25 and/or heparin-like molecules in a patient would be 

reduced, thereby increasing potency as more of the FGF 
will reach its targeted receptor (EP 0 298 723) . 

In order to improve or otherwise alter one or 
more of the characteristics of native KGF, protein 

30 engineering may be employed. Ron et al. (1993), J. 
Biol. Chem. , 2££lAl:2984-2988 reported modified KGF 
polypeptides having 3, 8, 27, 38 or 49 amino acids 
deleted from the N- terminus. Those polypeptides missing 
3, 8, or 27 N-t rminal r sidues retained heparin binding 

35 ability; the others did not. Also, the polypeptides 
missing 3 and 8 residues were reported as being fully 
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10 



15 



active, whereas the form missing 27 residues was 10-20 
fold less mitogenic, and the forms lacking 38 or 49 
amino acids did not have mitogenic activity. The 
stability of the modified KGF polypeptides was not 
discussed or otherwise reported. 

Published PCT application no. 90/08771, supra, 
also reported the production of a chimeric protein 
wherein about the first 40 N-terminal amino acids of 
mature form of native KGF were combined with the C- 
terminal portion (about 140 amino acids) of aFGF. . The 
chimera was reported to target keratinocytes like KGF, 
but it lacked susceptibility to heparin, a 
characteristic of aFGF but not KGF. The stability of 
the chimera was not discussed or otherwise reported. 

Thus, the literature has not reported a 
modified KGF molecule having significantly improved 
stability relative to native KGF. Moreover, the 
literature has not reported sufficient teachings or 
evidence to provide a reasonable expectation of 
successfully generating KGF molecules with such 
desirable characteristics . 

It is not currently possible to predict the 
characteristics of a protein based upon the knowledge of 
only its primary structure. For example, the mitogenic 
25 activity of aFGF is substantially increased in the 

presence of heparin, but the mitogenic activity of bFGF 
in the presence of heparin is only minimally increased, 
despite the fact that heparin tightly binds to bFGF 
[(Burgess and Maciag (1989), Annu. Rev. Biochem. , 
30 5fi:575-606; Schreiber, et al. (1985), Proc Natl. Acad. 
Sci. USA, £2:6138-6142; and Gospodarowizc and Cheng 
(1986), J. Cell Physiol., 12fi:475-485) ; and PCT 
90/00418)). m contrast, thymidine incorporation by 
BALB/MK cells is inhibited when heparin is included with 
35 KGF in the culture medium. 



20 
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Generally, the effects upon biological 
activity of any amino acid change upon the protein will 
vary depending upon a number of factors, including the 
three-dimensional structure of the protein and whether 
5 or not the modification is to either the heparin binding 
region or the receptor binding region on the primary 
sequence of the protein. As neither the three- 
dimensional structure nor the heparin binding region and 
the receptor binding region on the primary sequence of 

10 native KGF has been published, the knowledge within the 
art does not permit generalization about the effects of 
amino acid modifications to native KGF based upon the 
effects of amino acid modifications on even commonly 
categorized proteins. 

15 it is the object of this invention to provide 

polypeptide analogs of KGF and nucleic acid molecules 
encoding such analogs that exhibit enhanced stability 
(e.g., when subjected to typical pH, thermal and/or 
other storage conditions) as compared to native KGF. 

20 

Summary of the Invention 

The present invention provides novel, 
biologically active polypeptide analogs of KGF. For 

25 purposes of this invention, the term "KGF" includes 
native KGF and proteins characterized by a peptide 
sequence substantially the same as the peptide sequence 
of native KGF which retain some or all of the biological 
activity of native KGF, particularly non- fibroblast 

30 epithelial cell proliferation. By "characterized by a 
peptide sequence substantially the same as the peptide 
sequence of native KGF" is meant a peptide sequence 
which retains residues corresponding to Arg 41 , Gin 43 , 
Lys 55 - Lys 95 , Asn* 37 , Gin* 3 **, Lys l39 f Arg* 44 , Lys* 47 , 

35 Gin 152 , Lys 153 and Thr 154 of SEQ ID NO: 2 and which is 
encoded by a DNA sequence capable of hybridizing to 
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nucleotides 201 to 684 of SEQ. ID. NO:l, pr ferably 
under stringent hybridization conditions. 

The determination of a corresponding amino 
acid position between two amino acid sequences may be 
5 determined by aligning the two sequences to maximize 
matches of residues including shifting the amino and/or 
carboxyl terminus, introducing gaps as required and/or 
deleting residues present as inserts in the candidate. 
Database searches, sequence analysis and manipulations 
10 may be performed using one of the well-known and 
routinely used sequence homology/ identity scanning 
algorithm programs (e.g., Pearson and Lipman (1988), 
Proc. Natl. Acad. Sci. U.S.A., £5:2444-2448; Altschul et 
al. (1990), J. Mol. Biol., 215:403-410; Lipman and 
15 Pearson (1985), Science, 222:1435 or Devereux et al. 
(1984), Nuc. Acids Res., 12:387-395). 

Stringent conditions, in the hybridization 
context, will be stringent combined conditions of salt, 
temperature, organic solvents and other parameters 
20 typically controlled in hybridization reactions. 
Exemplary stringent hybridization conditions are 
hybridization in 4 X SSC at 62-67° C, followed by 
washing in 0.1 X SSC at 62-67° C. for approximately an 
hour. Alternatively, exemplary stringent hybridization 
25 conditions are hybridization in 45-55% fonnamide, 4 X 
SSC at 40-45°C, [See, T. Maniatis et. al., Molecular 
Cloning (A Laboratory Manual); Cold Spring Harbor 
Laboratory (1982), pages 387 to 389]. 

Thus, the proteins include allelic variations, 
30 or deletion(s), substitution (s) or insertion(s) of amino 
acids, including fragments, chimeric or hybrid molecules 
of native KGF. One example of KGF includes proteins 
having residues corresponding to Cys 1 and Cys 15 of SEQ ID 
NO: 2 replaced or deleted, with the resultant molecule 
35 having improved stability as compared with the parent 
molecule (as taught in commonly owned U.S. S.N. 
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08/487,825, filed on July 7 , 1995). Specifically 
disclosed molecules include: C(1,15)S, a KGF having 
substitutions of serine for cysteine at amino acid 
positions 1 and 15; AN15-AN24, KGFs having a deletion of 
5 any one of from the first 15 to 24 amino acids of the N- 
terminus of native KGF; AN3/C(15)S, a KGF having a 
deletion of the first 3 amino acids of the N-terminus of 
native KGF and a substitution of serine for cysteine at 
amino acid position 15; AN3/C{15)-, a KGF having a 

10 deletion of the first 3 amino acids of the N-terminus of 
native KGF and a deletion of cysteine at amino acid 
position 15; AN8/C(15)S, a KGF having a deletion of the 
first 8 amino acids of the N-terminus of native KGF and 
a substitution of serine for cysteine at amino acid 

15 position 15; AN8/C(15)-, a KGF having a deletion of the 
first 8 amino acids of the N-terminus of native KGF and 
a deletion of cysteine at amino acid position 15; 
0(1,15,40)5, a KGF having a substitution of serine for 
cysteine at amino acid positions 1, 15 and 40; 

20 C<1, 15,1Q2)S, a KGF having a substitution of serine for 
cysteine at amino acid positions 1, 15 and 102; and 
C (1,15, 102, 106) S, a KGF having a substitution of serine 
for cysteine at amino acid positions 1, 15, 102 and 106. 
Another example of KGF includes proteins 

25 generated by substituting at least one amino acid having 
a higher loop- forming potential for at least one amino 
acid within a loop-forming region of Asn 115 -His 1:I -6_ 
Tyr 117 -Asn 118 -Thr 119 of native KGF (as taught in 
commonly owned U.S. S.N. 08/323,473, filed on October 13, 

30 1994), specifically including H(116)G, a KGF having a 
substitution of glycine for histidine at amino acid 
position 116 of native KGF. 

A still further example includes proteins 
having one or more amino acid substitutions, deletions 

35 or additions within a region of 123-133 (amino acids 
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154-164 of SEQ ID N0:2) of native KGF; these proteins 
may have agonistic or antagonistic activity. 

Surprisingly, it has been discovered that by 
deleting or substituting neutral or negatively charged 
5 peptides for the more positively charged residues (i.e., 
substituting negatively charged residues for neutral or 
positively charged residues, or neutral residues for 
positively charged residues) of a KGF molecule (i.e., 
parent molecule) , the resultant KGF analog has improved 

10 stability as compared to the parent molecule. 

Preferably, in addition to having increased stability, 
the invention is directed to those analogs which also 
exhibit full biological activity (i.e., at least 
substantially similar receptor binding or affinity) as 

15 compared to native KGF. 

In another aspect of the invention, purified 
and isolated nucleic acid molecules encoding the various 
biologically active polypeptide analogs of KGF are 
described. In one embodiment, such nucleic acids 

20 comprise DNA molecules cloned into biologically 
functional plasmid or viral vectors. In another 
embodiment, nucleic acid constructs may then be utilized 
to stably transform a procaryotic or eucaryotic host 
cell. In still another embodiment, the invention 

25 involves a process wherein either a procaryotic 

(preferably E. coli) or eucaryotic host cell stably 
transformed with a nucleic acid molecule is grown under 
suitable nutrient conditions in a manner allowing the 
expression of the KGF analog. Following expression, the 

30 resultant recombinant polypeptide can be isolated and 
purified. 

A further aspect of the invention 
concerns pharmaceutical formulations comprising a 
therapeutically effective amount of a KGF analog and an 
35 acceptabl pharmaceutical carrier. Such formulations 
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will be useful in treating patients afflicted with 
epithelial diseases and injuries. 

In this vein, another aspect relates to 
methods of stimulating epithelial cell growth by 
5 administering to a patient a therapeutically effective 
amount of a KGF analog. In one embodiment, non- 
fibroblast epithelial cells are the cells whose 
proliferation is stimulated. Such epithelial cells 
include various adnexal cells, pancreatic cells, liver 
10 cells, and mucosal epithelium in the respiratory and 
gastrointestinal tracts. 

prjef PescrjpUon of the Figures 



15 Figure 1 shows the nucleotide (SEQ ID NO:l) 

and amino acid (SEQ ID NO: 2) sequences of native KGF 
(the nucleotides encoding the mature form of native KGF 
are depicted by bases 201 to 684 of SEQ ID NO:l and the 
mature form of KGF is depicted by amino acid residues 32 

20 to 194 of SEQ ID N0:2) . 

Figures 2A, 2B and 2C show the plasmid maps of 
PCFM1156, pCFM1656 and pCFM3102, respectively. 

Figure 3 shows the nucleotide (SEQ ID NO: 3) 
and amino acid (SEQ ID NO: 4) sequences of the construct 

25 RSH-KGF. 

Figure 4 shows the nucleotide (SEQ ID NO: 5) 
and amino acid (SEQ ID NO: 6) sequences of the construct 
contained in plasmid KGF. 

Figure 5 shows the chemically synthesized 

30 OLIGOs <OLIGO#6 through OLIGO#ll; SEQ ID NO: 12-17, 
respectively) used to substitute the DNA sequence 
between a Kpnl site and an EcoRI site (from amino acid 
positions 46 to 85 of SEQ ID No: 6) in the construct 
contained in plasmid KGF to produce the construct in 

35 plasmid KGF(dsd) . 
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Figure 6 shows the chemically synthesized 
OLIGOs (0LIG0#12 through OLIGO#24; SEQ ID NO: 18-30, 
respectively) used to construct KGF (codon optimized) . 

Figure 7 shows the nucleotide (SEQ ID NO: 31) 
5 and amino acid sequences (SEQ ID NO:32) of R(144)Q, a 
KGF analog having a substitution of glutamine for 
arginine at amino acid position 144 of native KGF. 

Figure 8 shows the nucleotide (SEQ ID NO: 33) 
and amino acid sequences (SEQ ID NO: 34) of 

10 C(l, 15) S/R(144)E, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
a substitution of glutamic acid for arginine at amino 
acid position 144 of native KGF. 

Figure 9 shows the nucleotide (SEQ ID NO: 35) 

15 and amino acid (SEQ ID NO: 36) sequences of 

C(l, 15)S/R(144)Q, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
a substitution of glutamine for arginine at amino acid 
position 144 of native KGF. 

20 Figure 10 shows the nucleotide (SEQ ID NO: 37) 

and amino acid (SEQ ID NO: 38) sequences of AN23/R(144)Q, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamine 
for arginine at amino acid position 144 of native KGF. 

25 Figure 11 shows the amount of soluble protein, 

determined by size exclusion HPLC, as a function of 
incubation time at 37°C. 

Figure 12 shows the estimated melting 
temperature (T m j as a function of pH for native KGF, 

30 C(1,15)S/R(144)Q and C (1, 15) S/R(144) E. 

Figure 13 shows a typical profile of mitogenic 
activity of R(144)Q, determined by measuring the 
incorporation of [ 3 HJ -Thymidine during DNA synthesis and 
by comparing it to a native KGF standard curve. 

35 Figure 14 shows a typical profile of the 

mitogenic activity of AN23/R{144)Q, determined by 
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measuring the incorporation of [ 3 H] -Thymidine during DNA 
synthesis and by comparing it to a native KGF standard 
curve. 

Figure 15 shows a typical profile of the 
5 mitogenic activity of C(l, 15) S/R(144)Q, determined by 
measuring the incorporation of [ 3 H] -Thymidine during DNA 
synthesis and by comparing it to a native KGF standard 
curve. 

Figure 16 shows a typical profile of the 
10 mitogenic activity of C (1, 15) S/R(144) E, determined by 

measuring the incorporation of [ 3 H] -Thymidine during DNA 
synthesis and by comparing it to a native KGF standard 
curve. 

Figure 17 shows the nucleotide (SEQ ID NO: 41) 
15 and amino acid (SEQ ID NO: 42) sequences of AN23/N(137)E, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamic 
acid for asparagine at amino acid position 137 of native 
KGF. 

20 Figure 18 shows the nucleotide (SEQ ID NO: 43) 

and amino acid (SEQ ID NO: 44) sequences of AN23/K(139)E, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamic 
acid for lysine at amino acid position 139 of native 

25 KGF. 

Figure 19 shows the nucleotide (SEQ ID NO: 45) * 
and amino acid (SEQ ID NO: 46) sequences of AN23/K(139)Q / 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamine 
30 for lysine at amino acid position 139 of native KGF. 

Figure 20 shows the nucleotide (SEQ ID NO: 47) 
and amino acid (SEQ ID NO: 48) sequences of AN23/R (144) A, 
a KGF analog having a del tion of the first 23 amino 
acids of the N-terminus and a substitution of alanine 
35 for arginine at amino acid position 144 of native KGF. 

Figure 21 shows the nucleotide (SEQ ID NO: 49) 
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and amino acid (SEQ ID NO: 50) sequences of AN23/R (144)L, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of leucine 
for arginine at amino acid position 144 of native KGF. 
5 Figure 22 shows the nucleotide (SEQ ID NO: 51) 

and amino acid (SEQ ID NO: 52) sequences of AN23/K(147)E, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamic 
acid for lysine at amino acid position 147 of native 
10 KGF. 

Figure 23 shows the nucleotide (SEQ ID NO: 53) 
and amino acid (SEQ ID NO:54) sequences of AN23/K(147)Q, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamine 

15 for lysine at amino acid position 147 of native KGF. 

Figure 24 shows the nucleotide (SEQ ID NO: 55) 
and amino acid (SEQ ID NO: 56) sequences of AN23VK(153)E, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamic 

20 acid for lysine at amino acid position 153 of native 
KGF. 

Figure 25 shows the nucleotide (SEQ ID NO: 57) 
and amino jicid (SEQ ID NO:58) sequences of £N23/K(153)Q, 
a KGF analog having a deletion of the first 23 amino 

25 acids of the N-terminus and a substitution of glutamine 
for lysine at amino acid position 153 of native KGF. 

Figure 26 shows the nucleotide (SEQ ID NO: 59) 
and amino acid (SEQ ID NO: 60) sequences of 
AN23/Q(152)E/K(153)E, a KGF analog having a deletion of 

30 the first 23 amino acids of the N-terminus and a 

substitution of glutamic acid for glutamine at amino 
acid position 152 of native KGF and glutamic acid for 
lysine at amino acid position 153 of native KGF. 
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Detailed Description 

In accordance with the present invention, 
novel analogs of KGF are provided. The KGF analogs are 
5 produced by deleting or substituting one or more 
specific, positively-charged residues in KGF. 

The KGF analogs have, among other properties, 
an improved stability under at least one of a variety of 
purification and/or storage conditions. For example, 

10 the KGF analogs will generally be purified in a greater 
yield of soluble, correctly folded protein. Moreover, 
once the material is purified, it will be more stable to 
pH, temperature, etc. as compared to the stability of 
the parent molecule. As described in the Examples 

15 section below (modified by the substitution of Gin and 
Glu for arginine at position 144 [R(144)Q and R(144)E, 
respectively] and in some instances modified at the N- 
terminus as well) exhibit, relative to native KGF, (1) a 
35 to 37.2 day increase of half -life upon storage at 

20 37°C, (2) a 7.5-9.5% higher thermal melting temperatures 
over the course of thermal unfolding, and (3) an 
increase in T m over a range of pH values. 

Although not intended to be bound by theory, a 
possible reason for the enhanced stability of the 

25 R{144)Q and R(144)E may be due to a reduction in overall 
charge density of a cluster of basic residues, which is 
inherently unstable due to charge repulsion, in the 
absence of heparin. The results set forth below suggest 
that the arginine residue at position 144 may correspond 

30 to a residue in bFGF, as determined by X-ray 

crystallography, which is reported to be within or near 
a cluster of basic residues that mediate heparin binding 
(Ago, et al. (1991), J". Biochem. , H£:360-363 ; and 
Erikss n et al. (1993), Protein Science, 2:1274-1284). 

35 Native KGF contains 46 charged residues, 27 of 

which carry a positive charge. In view of the results 
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10 



obtained with the KGF analogs, a comparison of the 
native KGF primary sequence with the primary sequence of 
bFGF suggests that some of the 27 positively charged 
residues form a cluster similar to a cluster found in 
the tertiary structure of bFGF. Depending on the 
location of such residues in the protein's three- 
dimensional structure, substitution of one or more of 
these clustered residues with amino acids carrying a 
negative or neutral charge may alter the electrostatic 
interactions of adjacent residues and may be useful to 
achieve increased stability. 

Thus other analogs, in addition to the 
preferred R(144)Q specifically set forth herein, are 
contemplated by the present invention. As used in this 
15 invention, a "KGF analog" or a "polypeptide analog of 
KGF" shall mean charge-change polypeptides wherein one 
or more of amino acid residues 41-154 (amino acids 72- 
185 of SEQ ID N0:2), specifically including amino acid 
residues 123-133 (amino acids 154-164 of SEQ ID N0:2) , 
are deleted or substituted with a neutral residue or 
negatively charged residue selected to effect a protein 
with a reduced positive charge. Preferred residues for 
modification are Arg«, Gln«3, L y S 55, Lys^, L y S i28 ( 

W37. Gln 138, Lys 13 9/ ^144, Lys l 47/ Gln 152, Lys 153 or 
25 Thrl54, with Glnl38, Lys 139 # ^144, Lys 147 # Gln l 52 Qr 

Lysi53 being more preferred ^ ^144 being most 
preferred. Preferred amino acids for substitution 
include glutamic acid, aspartic acid, glutamine, 
asparagine, glycine, alanine, valine, leucine, 
isoleucine, serine and threonine, with glutamic acid, 
glutamine, aspartic acid, asparagine and with alanine 
being particularly preferred. 

Any modification should give consideration to 
minimizing charge repulsion in the tertiary structure of 
the molecule; most preferably the analog will have 
incr ased stability compared with the parent mol cule. 



20 



30 



35 
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Obviously, the deletions or substitutions should not be 
so numerous nor be made to residues of such close 
proximity so as to set up charge repulsion between two 
negatively- charged residues. 
5 When the KGF analogs are biologically 

generated, i.e., are the products of cellular expression 
as opposed to the products of solid state synthesis, 
proteolytic or enzymatic derivatization of naturally- 
occurring products, etc., the nucleic acids encoding 

10 such polypeptides will differ in one or more nucleotides 
as compared to the native KGF nucleotide sequence. Such 
nucleotides may be expressed and the resultant 
polypeptide purified by any one of a number of 
recombinant technology methods known to those skilled in 

15 the art. 

DNA sequences coding for all or part of the 
KGF analogs may include, among other things, the 
incorporation of codons "preferred" for expression in 
selected host cells (e.g., "£. coli expression codons"); 

20 the provision of sites for cleavage by restriction 
enzymes; and the provision of additional initial, 
terminal, or intermediate nucleotide sequences (e.g., as 
an initial methionine amino acid residue for expression 
in E. coli cells), to facilitate construction of readily 

25 expressed vectors. 

The present invention also provides 
recombinant molecules or vectors for use in the method 
of expression of the polypeptides. Such vectors may be 
comprised of DNA or RNA and can be circular, linear, 

30 single-stranded or double-stranded in nature and can be 
naturally-occurring or assemblages of a variety of 
components, be they naturally-occurring or synthetic. 

Many examples of such expression vectors axe 
known. The components of the v ctors, e.g. replicons, 

35 selection genes, enhancers, promoters, and th like, may 
be obtained from natural sources or synthesized by known 
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procedures. In each case, expression vectors useful in 
this invention will contain at least one expression 
control element functionally associated with the 
inserted nucleic acid molecule encoding the KGF 
5 polypeptide analog. This control element is responsible 
for regulating polypeptide expression from the nucleic 
acid molecules of the invention. Useful control 
elements include, for example, the lac system, the trp 
system, the operators and promoters from phage X, a 
10 glycolytic yeast promoter, a promoter from the yeast 

acid phosphatase gene, a yeast alpha-mating factor, and 
promoters derived from adenovirus, Epstein-Barr virus, 
polyoma, and simian virus, as well as those from various 
retroviruses. However, numerous other vectors and 
15 control elements suitable for procaryotic or eucaryotic 
expression are known in the art and may be employed in 
the practice of this invention. 

Examples of suitable procaryotic cloning 
vectors may include plasmids from E. coli (e.g. pBR322, 
20 col El, pUC, and the F-f actor) , with preferred plasmids 
being pCFM1156 (ATCC 69702), pCFM1656 (ATCC 69576) and 
PCFM3102 (described in the Examples section, below) . 
Other appropriate expression vectors of which numerous 
types are known in the art for mammalian, insect, yeast, 
25 fungal and bacterial expression can also be used for 
this purpose. The transfection of these vectors into 
appropriate host cells can result in expression of the 
KGF analog polypeptides. 

Host microorganisms useful in this invention 
30 may be either procaryotic or eucaryotic. Suitable 

procaryotic hosts include various E. coli (e.g., FM5, 
HB101, DH5CCDH10, and MC1061) , Pseudomonas, Bacillus, and 
Streptomyces strains, with E. coli being preferred. 
Suitable eucaryotic host cells include yeast and other 
35 fungi, insect cells, plant cells, and animal cells, such 
as COS (e.g., COS-1 and COS-7) and CV-1 monkey cell 
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lines, 3T3 lines derived from Swiss, Balb-c or NIH 
cells, HeLa and L-929 mouse cells, and CHO, BHK or HaK 
hamster cells. Depending upon the host employed, 
recombinant polypeptides produced in accordance herewith 
5 will be glycosylated with mammalian or other eucaryotic 
carbohydrates or may be non-glycosylated. 

The preferred production method will vary 
depending upon many factors and considerations; the 
optimum production procedure for a given situation will 

10 be apparent to those skilled in the art through minimal 
experimentation. The resulting expression product may 
then be purified to near homogeneity using procedures 
known in the art. A typical purification procedure for 
procaryotic cell production involves rupturing the cell 

15 walls by high pressure or other means, centrifugation or 
filtration to remove cellular debris., followed by ion 
exchange chromatography of supernatant or filtrate and, 
finally, hydrophobic interaction chromatography. If the 
analog is expressed in insoluble form, another 

20 purification technique involves first solublizing the 

inclusion bodies containing the analogs followed by ion 
exchange chromatography, then refolding of the protein, 
and, finally, hydrophobic interaction chromatography. 
Exemplary purification techniques are taught in commonly 

25 owned U.S. S.N. 08/323,339, filed on October 13, 1994. 
Generally, U.S. S.N. 08/323,339 teaches a method for 
purifying a keratinocyte growth factor comprising: (a) 
obtaining a solution comprising the KGF; (b) binding the 
KGF from the solution of part (a) to a cation exchange 

30 resin; (c) eluting the KGF in an eluate solution from 

the cation exchange resin; (d) either passing the eluate 
solution from part (c) through an appropriate molecular 
weight exclusion matrix or performing hydrophobic 
interaction chromatography on the eluate solution of 

35 part (c) ; and (e) r covering the KGF from the molecular 
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weight exclusion matrix or hydrophobic interaction 
chromatography . 

Of course, the analogs may be rapidly screened 
to assess their physical properties. The Examples sets 
5 forth various well-known stability assays, although the 
specific assay used to test the analog is not critical. 
Moreover, the level of biological activity (e.g., 
receptor binding and/or affinity, mitogenic, cell 
proliferative and/or in vivo activity) may also be 
10 tested using a variety of assays, some of which are set 
forth in the Examples section. Numerous assays are 
well-known and can be used to quickly screen the KGF 
analogs to determine whether or not they possess 
acceptable biological activity. One such assay 
15 specifically tests the KGF analogs for the ability to 
bind to the KGF receptor (KGFR) by competing with 
125 I-KGF binding (Bottaro et al. (1990), J. Biol. Chem. , 
2^:12767-12770; Ron et al. (1993), J. Biol. Chem., 
2£fl: 2984-2988) . An alternative method for assaying 
20 KGFR /KGF analog interactions involves the use of 

techniques such as real time biospecific interaction 
analysis (BIA) (Felder et al. (1993), Molecular & 
Cellular Biology, 11:1449-1455). Additionally a 
mitogenic assay can be utilized to test the ability of 
25 the KGF analogs to stimulate DNA synthesis (Rubin et al. 
(1989), supra). Finally, cell proliferative assays can 
be utilized to test the ability of the KGF analogs to 
stimulate cell proliferation (Falco, et al. (1988), 
Oncogrene, 2:573-578). Using any of the aforementioned 
30 assay systems, KGF analogs can be rapidly screened for 
their biological activity. 

The KGF analogs may be further modified to 
contain additional chemical moieties not normally a part 
of the peptide. Such derivatized moieties may improve 
35 the solubility, absorption, biological half life, and 
the like of the KGF analog. The moieties may 
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alternatively eliminate or attenuate any undesirable 
side effects of the protein and the like. Moieties 
capable of mediating such effects are disclosed, for 
exairple, in REMINGTON'S PHARMACEUTICAL SCIENCES, 18th 
5 ed., Mack Publishing Co., Easton, PA (1990). Covalent 
modifications may be introduced into the molecule by 
reacting targeted amino acid residues of the peptide 
with an organic derivatizing agent that is capable of 
reacting with selected side chains or terminal residues 

10 (T.E. Creighton (1983), PROTEINS: STRUCTURE AND MOLECULE 
PROPERTIES, W.H. Freeman & Co., San Francisco, 
pp. 79-86). Polyethylene glycol ("PEG") is one such 
chemical moiety which has been used in the preparation 
of therapeutic protein products. For some proteins, the 

15 attachment of polyethylene glycol has been shown to 
protect against proteolysis, Sada, et al. (1991) , J. 
Fermentation Bioengineering, 71:137-139 / and methods 
for attachment of certain polyethylene glycol moieties 
are available . £££ U.S. Patent No. 4,179,337, Davis et 

20 al., "Non-Immunogenic Polypeptides," issued December 18, 
1979; and U.S. Patent No. 4,002,531, Royer, "Modifying 
enzymes with Polyethylene Glycol and Product Produced 
Thereby," issued January 11, 1977. For a review, see 
Abuchowski et al. # in Enzymes as Drugs. (Holcerberg and 

25 Roberts, (eds.) pp. 367-383 (1981)). For polyethylene 
glycol, a variety of means have been used to attach the 
polyethylene glycol molecules to the protein. 
Generally, polyethylene glycol molecules are connected 
to the protein via a reactive group found on the 

30 protein. Amino groups, such as those on lysine residues 
or at the N- terminus, are convenient for such 
attachment. For example, Royer (U.S. Pat. 
No. 4,002,531, above) states that reductive alkylation 
was used for attachment of poly thylene glycol molecules 

35 to an enzyme. EP 0 539 167, published April 28, 1993, 
Wright, "Peg Imidates and Protein Derivates Thereof 
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states that peptides and organic compounds with free 
amino group(s) are modified with an imidate derivative 
of PEG or related water-soluble organic polymers. U.S. 
Patent No. 4,904,584, Shaw, issued February 27, 1990, 
5 relates to the modification of the number of lysine 

residues in proteins for the attachment of polyethylene 
glycol molecules via reactive amine groups. 

In yet another embodiment, the present 
invention is directed to a single-dose administration 
10 unit of a medicinal formulation which can be safely 

administered parenterally or orally to treat a disease 
in a warm-blooded animal (such as a human) . Such 
medicinal formulation may be in the form of a 
lyophilized or otherwise dehydrated therapeutic or 
15 diagnostic which can be reconstituted by the addition of 
a physiologically acceptable solvent. The solvent may 
be any media such as sterile water, physiological saline 
solution, glucose solution or other aqueous 
carbohydrates (e.g., polyols such as mannitol, xylitol, 
20 glycerol) which is capable of dissolving the dried 
composition, is compatible with the selected 
administration route and which does not negatively 
interfere with the active principle and the 
reconstitution stabilizers employed. In a specific 
25 embodiment, the present invention is directed to a kit 
for producing the single-dose administration unit. The 
kit contains both a first container having a dried 
protein and a second container having an aqueous 
formulation comprising a reconstitution stabilizer. As 
30 for the concentration of the protein in the solution, 
the solution volume which is charged into each 
container, and the capacity of the containers 
(interrelated parameters which can be suitably modified, 
depending upon the desired concentration of active 
35 principl in the end-dosage unit) , these may vary within 
wide ranges w 11-known to skilled artisans. 
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KGF analogs according to the invention may be 
useful as therapeutic and diagnostic agents and as 
research reagents. Thus the KGF analogs may be used in 
in vitro and/or in vivo diagnostic assays to quantify 
5 the amount of KGF in a tissue or organ sample or to 
determine and/or isolate cells which express KGFR 
(Bottaro et al. (1990), J. Biol. Chem. , 2£5: 12767-12770; 
Ron et al. (1993), J. Biol. Chem., 2£fi:2984-2988) . in 
assays of tissues or organs there will be less 

10 radioactivity from 12 5 1 -KGF analog binding to KGFR, as 
compared to a standardized binding curve of 125 I-KGF 
analog, due to unlabeled native KGF binding to KGFR. 
Similary, the use of 125 I-KGF analog may be used to 
detect the presence of KGFR in various cell types. 

15 This invention also contenplates the use of a 

KGF analog in the generation of antibodies made against 
the peptide, which antibodies also bind to native KGF. 
In this embodiment, the antibodies are monoclonal or 
polyclonal in origin and are generated using a KGF 

20 analog. The resulting antibodies bind preferentially to 
native KGF, preferably when that protein is in its 
native (biologically active) conformation. These 
antibodies can be used for detection or purification of 
the KGF. 

25 Moreover, the invention contemplates the use 

of KGF analogs in the discovery of high affinity or low 
affinity KGF binding molecules having therapeutical 
applications, for example, as a way for efficient KGF 
delivery or as an inhibitor for KGF activity. The 

30 thermal stability of the KGF analogs is important to 
identify such binding molecules in physiological 
conditions (i.e., at 37*C) since their affinity for KGF 
could be strongly temperature-dependent and may be 
unpredictable from the affinity observed at 4*C. 

35 For in vivo us s, the KGF analogs may be 

formulated with additives. Such additives include 



96/11951 



- 22 - 



PCT/US95/13075 



buffers, carriers, stabilizers, excipients, 
preservatives, tonicity adjusting agents, anti-oxidants 
and the like (e.g., viscosity adjusting agents or 
extenders) . The selection of specific additives will 
depend upon the storage form (i.e., liquid or 
lyophilized) and the modes of administering the KGF 
analog. Suitable formulations, known in the art, can be 
found in REMINGTON'S PHARMACEUTICAL SCIENCES (latest 
edition) , Mack Publishing Company, Easton, PA, 

The KGF analogs may be applied in 
therapeutically effective amounts to tissues 
specifically characterized by having damage to or 
clinically insufficient numbers of non- fibroblast 
epithelium cells. Since KGF binds to heparin, it is 
likely that heparin, heparin sulfate, heparin-like 
glycosaminglycans and heparin-like glycosaminoglycans , 
which are present in the extracellular environment may 
bind KGF in vivo. It follows that KGF analogs with 
reduced heparin binding ability will have enhanced 
potency, as more KGF will reach its targeted receptor 
and will not be sequestered by heparin and heparin-like 
compounds in the extracellular environment. These 
analogs will be more useful therapeutically, as lower 
dosages of a particular KGF analog will be required per 
treatment . 

The KGF analogs may be applied in 
therapeutically effective amounts to tissues 
specifically characterized by having damage to or 
clinically insufficient numbers of non-f ibroblast 
epithelium cells. Areas in which KGF analogs may be 
successfully administered include, but are not limited 
to: the stimulation, proliferation and differentiation 
of adnexal structures such as hair follicles, sweat 
glands, and sebaceous glands in patients with burns and 
other partial and full- thickness injuries; accelerat d 
r epithelialization of lesions caused by epidermolysis 
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bullosa, which is a defect in adherence of the epidermis 
to the under lying dermis, resulting in frequent open, 
painful blisters which can cause severe morbidity; 
preventing chemotherapy- induced alopecia and treating 
5 male-pattern baldness, or the progressive loss of hair 
in men and women; treating gastric and duodenal ulcers; 
treating inflammatory bowel diseases, such a Crohn's 
disease (affecting primarily the small intestine) and 
ulcerative colitis (affecting primarily the large 

10 bowel) ; preventing or reducing gut toxicity in radiation 
and chemotherapy treatment regimes through treatment 
(e.g., pretreatment and/or postreatment) to induce a 
cytoprotective effect or regeneration or both; 
stimulating the production of mucus throughout the 

15 gastrointestinal tract; inducing the proliferation and 
differentiation of type II pneumocytes, which may help 
treat or prevent diseases such as hyaline membrane 
disease (i.e., infant respiratory distress syndrome and 
bronchopulmonary dysplasia) in premature infants; 

20 stimulating the proliferation and differentiation of the 
bronchiolar and/or alveolar epithelium with acute or 
chronic lung damage or insufficiency due to inhalation 
injuries (including high oxygen levels), emphysema, use 
of lung damaging chemo therapeutics, ventilator trauma or 

25 other lung damaging circumstances; increasing liver 
function to treat or prevent hepatic cirrhosis, 
fulminant liver failure, damage caused by acute viral 
hepatitis and/or toxic insults to the liver; inducing 
corneal cell regeneration, for example in the treatment 

30 of corneal abrasion; inducing epithelial cell 

regeneration to treat progressive gum disease; inducing 
regeneration of tympanic epithelial cells to treat ear 
drum damage and treating or preventing the onset of 
diabetes mellitus or as an adjunct in the setting of 

35 islet c 11 transplantation. 
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A patient in need of proliferation of non- 
fibroblast epithelial cells will be administered an 
effective amount of a KGF analog. An "effective amount" 
is that amount of KGF analog required to elicit the 
5 desired response in the patient being treated and will, 
thus, generally be determined by the attending 
physician. Factors influencing the amount of KGF analog 
administered will include the age and general condition 
of the patient, the disease being treated, etc. Typical 

10 dosages will range from 0*001 mg/kg body weight to 500 
mg/kg body weight. 

The KGF analog may be safely administered 
parenterally (e.g., via IV, IT, IM, SC, or IP routes), 
orally or topically to warm-blooded animals (such as 

15 humans) . The KGF analog may be used once or 

administered repeatedly, depending on the disease and 
condition of the patient. In some cases, the KGF analog 
may be administered as an adjunct to other therapy and 
also with other pharmaceutical preparations. 

20 The following examples are included to more 

fully illustrate the present invention. It is 
understood that modifications can be made in the 
procedures set forth, without departing from the spirit 
of the invention. 



25 



EXAMPLES 



Standard methods for many of the procedures 
described in the following examples, or suitable 

30 alternative procedures, are provided in widely 

recognized manuals of molecular biology such as, for 
example, Molecular Cloning, Second Edition, Sambrook et 
al., Cold Spring Harbor Laboratory Press (1987) and 
Current Protocols in Molecular Biology, Ausabel et al. , 

35 Greene Publishing Associates /Wiley Interscience, New 
York (1990) . 
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EXAMPLE 1: Preparation of DNA Coding for KGF and KGF Analogs 

The cloning of the full-length human KGF gene 
5 (encoding a polypeptide with the sequence of native KGF) 
was carried out both by polymerase chain reaction (PCR) 
of RNA from an animal cell and by PCR of chemically 
synthesized (E. coli optimized codon) oligonucleotides 
( "OLIGOs" ) - Both procedures are described below: 

10 PCR amplification using RNA isolated from 

cells known to produce the polypeptide was performed. 
Initially, cells from a human fibroblast cell line 
AG1523A (obtained from Human Genetic Mutant Cell Culture 
Repository Institute For Medical Research, Camden, New 

15 Jersey) were disrupted with guanidium thiocyanate, 
followed by extraction (according to the method of 
Chomyzinski et al. (1987), Anal. Biochem., 172:156) . 
Using a standard reverse transcriptase protocol for 
total RNA, the KGF cDNA was generated. PCR (PCR#1) 

20 amplification of the KGF gene was carried out using the 
KGF cDNA as template and primers OLIGO#l and OLIGO#2 
that encode DNA sequences immediately 5 ' and 3 ' of the 
KGF gene [Model 9600 thermocycler (Perkin-Elmer Cetus, 
Norwalk, CT) ; 28 cycles; each cycle consisting of one 

25 minute at 94°C for denaturation, two minutes at 60°C for 
annealing, and three minutes at 72°C for elongation] . A 
small aliquot of the PCR#1 product was then used as 
template for a second KGF PCR (PCR#2) amplification 
identical to the cycle conditions described above except 

30 for a 50°C annealing temperature . For expression 

cloning of the KGF gene, nested PCR primers were used to 
create convenient restriction sites at both ends of the 
KGF gene. 0LIG0#3 and OLIGO#4 were used to modify the 
KGF DNA product from PCR#2 to include Mlul and BaxnHI 

35 restriction sites at the 5* and 3' ends of the gene, 

respectively [PCR#3; 30 cycles; each cycle consisting of 
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one minute at 94°C for denaturation, two minutes at 60°C 
for annealing, and three minutes at 72°C for 
elongation] . This DNA was subsequently cut with Mlul 
and BamHl, phenol extracted, and ethanol precipitated. 
5 It was then resuspended and ligated (using T4 ligase) 
into a pCFM1156 plasmid (Figure 2A) that contained a 
"RSH M signal sequence to make construct RSH-KGF 
(Figure 3) . 

The ligation products were transformed 
10 (according to the method of Hanahan (1983), J. Mol. 

Biol., 1£6:557) into E* coli strain FM5 (ATCC: 53911) 
and plated onto LB+kanamycin at 28°C. Several 
transformants were selected and grown in small liquid 
cultures containing 20 jig/mL kanamycin. The RSH-KGF 
15 plasmid was isolated from the cells of each culture and 
DNA sequenced. Because of an internal Ndel site in the 
KGF gene, it was not possible to directly clone the 
native gene sequence into the desired expression vector 
with the bracketed restriction sites of Ndel and BamHI. 
20 This was accomplished as a three-way ligation. Plasmid 
RSH-KGF was cut with the unique restriction sites of 
BsmJ and SstI, and a -3 kbp DNA fragment (containing the 
3 1 end of the KGF gene) was isolated following 
electrophoresis through a 1% agarose gel. A PCR (PCR#4) 
25 was carried out as described for PCR#3 except for the 
substitution of 0LIG0#5 for OLIGO#3. The PCR DNA 
product was then cut with Ndel and BsmI and a 311 bp DNA 
fragment was isolated following electrophoresis through 
a 4% agarose gel. The third piece of the ligation is a 
30 1.8 kbp DNA fragment of pCEM1156 cut with Ndel and SstJ 
which was isolated following electrophoresis through a 
1% agarose gel. Following ligation (T4 ligase), 
transformation, kanamycin selection and DNA sequencing 
as described above, a clone was picked containing the 
35 construct in Figure 4 and the plasmid d signated KGF. 
Because of an internal ribosomal binding site that 



WO 96/11951 



- 27 - 



PCT/US95713075 



produced truncated products, the KGF DNA sequence 
between the unique Kpnl and EcoRI sites was replaced 
with chemically synthesized OLIGOs (0LIG0#6 through 
0LIG0#11) to minimize the use of the internal start site 
5 (Figure 5) . 

OLIGO# 1 ( SEQ ID NO: 7) : 5 * -CAATGACCTAGGAGTAACAATCAAC - 3 1 

OLIGO# 2 ( SEQ ID NO : 8 ) : 5 ' - AAAACAAACATAAATGCACAAGTCCA- 3 ' 

OLIGO# 3 { SEQ ID NO: 9 ) : 5 ' -ACAACGCGTGCAATGACATGACTCCA- 3 ' 

10 OLIGO#4 (SEQ ID NO:10): 

5 ' -ACAGGATCCTATTAAGTTATTGCCATAGGAA-3 ' 
OLIGO#5 (SEQ ID NO: 11) : 

5 • - AC AC ATATGTGC AATGACATG ACTCCA - 3 * 
OLIGO#6 (SEQ ID NO: 12): 
15 5 ' -CTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCC- 3 * 

OLIGOS7 (SEQ ID NO:13): 

5 ' -AAGAGATGAAAAACAACTACAATATTATGGAAATCCGTACTGTT- 3 ' 
OLIGO#8 (SEQ ID NO: 14) : 

5 * -GCTGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTG-3 ■ 
20 OLIGO#9 (SEQ ID NO: 15) : 

5 ' - TCTTGGGTGCCCTTGACTTTGCCGCGTTTGTCGATACGCAGGTAC- 3 1 
OLIGO#10 (SEQ ID NO: 16): 

5 ' -ACAGCAACAGTACGGATTTCCATAATATTGTAGTTGTTTTTCATC-3 ' 
OLIGO#ll (SEQ ID NO:17): 
25 5 • - AATTCAGATTCAACACCTTTGATTGCAACGATACCA- 3 ' 

The OLIGOs were phosphorylated with T4 
polynucleotide kinase and then heat denatured. The 
single-stranded (ss) OLIGOs were then allowed to form a 

30 ds DNA fragment by allowing the temperature to slowly 
decrease to room temperature. T4 ligase was then used 
to covalently link both the internal OLIGO sticky-ends 
and the whole ds OLIGO fragment to the KGF plasmid cut 
with Kpnl and EcoRI. The new plasmid was designated 

35 KGF(dsd). 
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A completely E. coli codon-optimized KGF gene 
was constructed by PCR amplification of chemically 
synthesized OLIGOs #12 through 24. 

5 OLIGOtt 12 ( SEQ ID NO : 1 8 ) : 5 * -AGTTTTGATCTAGAAGGAGG- 3 ' 

OLIGO# 13 ( S EQ I D NO : 1 9 ) : 5 ' - TC AAAACTGGATCCTATTAA- 3 ' 
OLIG0#14 (SEQ ID NO:20) : 

5 1 - AGTTTTGATCTAGAAGGAGG AATAACATATGTGCAACGACATG- 
ACTCCGGAACAGATGGCTACCAACGTTAACTGCTCCAGCCCGGAACGT- 3 ' 
10 OLIG0#15 (SEQ ID NO:21) : 

5 ' - C AC ACCCGTAGCTACG ACTACATGGAAGGTGGTGAC ATCCGT - 
GTTCGTCGTCTGTTCTGCCGTACCCAGTGGTACCTGCGTATCGACAAA- 3 • 
OLIGO#16 (SEQ ID NO:22) : 

5 1 - CGTGGTAAAGTTAAAGGTACCC AGGAAATGAAAAACAACTAC AACATC - 
1 5 ATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAA- 3 ' 

0LIG0#17 (SEQ ID NO:23) : 

5 ' -GGTGTTGAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACT- 
GTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAGAA- 3 ' 
OLIGO#18 (SEQ ID N0:24> : 

20 5 1 -CTGATCCTGGAAAACCACTACAACACCTACGCATCTGCTAAATGGAC- 

CCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGT- 3 ' 
OLIGO#19 (SEQ ID NO:25) : 

5 ' - ATCCCGGTTCGTGGTAAAAAAACC AAAAAAGAACAGAAAACCGCTC - 
ACTTCCTGCCGATGGC AATC ACTTAAT AGGATCCAGTTTTGA - 3 ' 
25 OLIGO#20 ( SEQ ID NO : 26 ) : 5 * -TACGGGTGTGACGTTCCGGG~3 * 

OLIGO#2 1 ( SEQ ID NO: 27) ; 5 ' - CTTTACCACGTTTGTCGATA- 3 • 
OLIGO#22 { SEQ ID NO: 28) : 5 1 - ATTCAACACCTTTGATTGCA- 3 • 
OLIGO#23 (SEQ ID NO: 29) : 5 • -CCAGGATCAGTTCTTTGAAG-3 • 
0LIG0#24 ( SEQ ID NO : 3 0 ) : 5 • -GAACCGGGATACCTTTCTGG- 3 • 

30 

OLIGOs #12 through 24 were designed so that 
the entire DNA sequence encoding native KGF was 
represented by OLIGOs from either the "Watson" or the 
•Crick- strand and upon PCR amplif ication would produce 
35 the desired double-stranded DNA sequence (Figure 6) 

[PCR#5, Model 9600 thermocycler , Perkin-Elmer Cetus] ; 21 
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cycles, each cycle consisting of 31 seconds at 94°C for 
denaturation, 31 seconds at 50°C for annealing, and 31 
seconds at 73°C for elongation; following the 21 cycles 
the PCR was finished with a final elongation step of 7 
5 minutes] . After PCR amplification, the DNA fragment was 
cut with Xbal and BamHI and the 521 bp fragment ligated 
into the expression plasmid pCFM1156 cut with the same 
enzymes. PCR#5 utilized the outside primers (100 
pmoles/100 nl rxn) OLIGO#12 and 0LIGO#13 and 1 *il/100 |il 
10 rxn of a KGF template derived by ligation (by T4 ligase) 
of OLIGO #14 through OLIG0#19 (OLIGO#15 through OLIGO#18 
were phosphorylated with T4 polynucleotide kinase) 
using OLIGO#20 through OLIGO#24 as band-aid oligos 
(Jayaraman et al. (1992), Biotechniques, 12:392) for the 
15 ligation. The final construct was designated KGF 
(codon optimized) . 

All of the KGF analogs described herein are 
composed in part from DNA sequences found in KGF(dsd) or 
KGF (codon optimized), or a combination of the two. The 
20 sequences are further modified by the insertion into 
convenient restriction sites of DNA sequences that 
encode the particular KGF analog amino acids made 
utilizing one or more of the above -described techniques 
for DNA fragment synthesis. Any of the analogs can be 
25 generated in their entirety by the above described 
techniques. However, as a part of the general OLIGO 
design optimized E. coli codons were used where 
appropriate, although the presence of E. coli optimized 
codons in part or in to to of any of the genes where 
30 examined did not significantly increase the yield of 
protein that could be obtained from cultured bacterial 
cells. Figures 7 to 10 and 17 to 26 set forth by 
convenient xample particular KGF analog nucleotide and 
amino acid sequence constructions: R(144)Q (Figure 7) ; 
35 C(1,15)S/R(144)E (Figure 8) ; C(l, 15) S/R(144)Q 
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(Figure 9); AN23/R(144)Q (Figure 10); AN23/N(137)E 
(Figure 17); AN23/K (139) E (Figure 18); AN23/K(139)Q 
(Figure 19) ; AN23/R(144)A (Figure 20); AN23/R(144)L 
(Figure 21); AN23/K(147)E (Figure 22); AN23/K(147)Q 
5 (Figure 23); AN23/K(153)E (Figure 24); AN23/K(153)Q; 
(Figure 25) and AN23/Q(152)E/K{153) E (Figure 26). All 
the KGF analog constructions described herein were DNA 
sequence confirmed. 

10 EXAMPLE 2 : Production in E. coli 

Three different expression plasmids were 
utilized in the cloning of the KGF analog genes. They 
were pCFM1156 (ATCC# 69702), pCFM1656 (ATCC# 69576), and 

15 pCFM3102 (Figures 2A, 2B and 2C, respectively) . The 
plasmid p3102 can be derived from the plasmid pCFM1656 
by making a series of site-directed base changes with 
PGR overlapping oligo mutagenesis. Starting with the 
Bglll site (pCFM1656 plasmid bp # 180) immediately 5 1 to 

20 the plasmid replication promoter, PcopB, an ^ proceeding 
toward the plasmid replication genes, the base pair 
changes are as follows: 



WO 96/11951 



- 31 - 



PCT/OS95/13075 



P PFM1656 bp # bp in PCFM1656 bp changed to in PCFM31Q2 







U 
ff 


jZ\J1 


1 / n 


C/G 








A'iO 


A /T 

A/ 1 


\J / 






u 

ff 








c 

D 






CI 7 
OX / 




insert two G> 






# 


677 


G/C 


T/A 






# 


978 


T/A 


C/G 






# 


992 




A/T 






# 


1002 


A/T 


C/G 


10 




# 


1005 


C/G 


T/A 






# 


1026 


A/T 


T/A 






# 


1045 


C/G 


T/A 






# 


1176 


G/C 


T/A 






# 


1464 


G/C 


T/A 






# 


2026 


G/C 


bp deletion 






# 


2186 


C/G 


T/A 






# 


2479 


A/T 


T/A 




# 


2498-2501 


AGTG 


GTCA 


20 








TCAC 


CAGT 




# 


2641-2647 


TCCGAGC 


bp deletion 










AGGCTCG 




25 


# 


3441 


G/C 


A/T 




# 


3452 


G/C 


A/T 




# 


3649 


A/T 


T/A 




# 


4556 




insert bps 



30 (SEQ ID NO: 39) 5 ' -ffftCTTrArTAGTGTCGACCTGCAG-3 ' 

{SEQ ID NO: 40) 3 ' -CTCGAGTGATCACAGCTGGACGTC-5 1 



As seen above, pCrail56, pCFMl656 and pCFM3102 
are very similar to each other and contain many of the 

35 same restriction sites. The plasmids were chosen by 

convenience, and the vector DMA components can be easily 
exchanged for purposes of new constructs . The host used 
for all cloning was E. coli strain FM5 (ATCC: 53911) and 
the transformations were carried out (according to the 

40 method of Hanahan (1983), supra) or by electroelution 
with a Gene Pulser*" transfection apparatus (BioRad 
Laboratories, Inc., Hercules, CA) according to the 
manufacturer ' s instructions . 

Initially, a small, freshly cultured inoculum 

45 of the desired recombinant E. coli clone harboring the 
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desired construct on one of the three pCFM vectors was 
started by transferring 0,1 mL of a frozen glycerol 
stock of the appropriate strain into a 2 L flask 
containing 500 mL of Luria broth. The culture was 
5 shaken at 30 *C for 16 hours, after which the culture was 
transferred to a 15 L fermentor containing 8 L of 
sterile batch medium (Tsai, etal. (1987), J. Industrial 
Microbiol., 2:181-187). 

Feed batch fermentation starts with the 
10 feeding of Feed # 1 medium (Tsai, et a 1 . (1987), 

supra) . When the OD600 reached 35, expression of the 
desired KGF analog was induced by rapidly raising the 
culture temperature to 37 'C for two hours then up to 
42 *C to denature the CI repressor. The addition of Feed 
15 1 was discontinued in favor of Feed 2, the addition rate 
of which was initiated at 300 mL/hr. Feed 2 comprised 
175 g/L trypticase-peptone, 87.5 g/L yeast extract, and 
260 g/L glucose. After one hour at 42 'C, the culture 
temperature was decreased to 36 *C, where this 
20 temperature was then maintained for another 6 hours. 

The fermentation was then halted and the cells 
were harvested by centrifugation into plastic bags 
placed within 1 L centrifuge bottles. The cells were 
pelleted by centrifugation at 400 rpm for 60 minutes, 
25 after which the supernatants were removed and the cell 
paste frozen at -90 # C. 

Following expression of the various KGF 
analogs, in E. coli, native KGF, R{144)Q, 
C(1,15)S/R(144)E, C(1,15)S/R(144)Q and AN23/R(144)Q 
30 proteins were purified using the following procedure. 
Cell paste from a high cell density fermentation was 
suspended at 4 # C in 0.2 M NaCl, 20 mM NaP0 4 , pH 7.5 as a 
10-20% solution (w ight per volume) using a suitable 
high shear mixer. The suspended cells were then lysed 
35 by passing the solution through a homogenizer (AFV 

Gaulin, Inc., Everett, MA) three times. The outflowing 
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homogenate was cooled to 4-8 m C by using a suitable heat 
exchanger. Debris was then removed by centrifuging the 
lysate in a J-6B™ centrifuge (Beckman Instruments, Inc., 
Brea, CA) equipped with a JS 4.2 rotor at 4,200 rpm for 
5 30-60 min. at 4*C. Supernatants were then carefully 

decanted and loaded onto a previously prepared 450 mL (5 
cm x 23 cm) column of S-Sepharose Fast FloW™ resin 
(Pharmacia, Piscataway, NJ) equilibrated with 0.2 M 
NaCl, 20 mM NaP04, pH 7.5 at 4*C. Next, the column was 

10 washed with five column volumes (2250 mL) of 0.4 M NaCl, 
20 mM NaP04, pH 7.5 at 4'C. The desired protein was 
eluted by washing the column with 5 L of 0.5 M NaCl, 20 
mM NaP04/ pH 7.5. Then, 50 mL fractions were collected 
and the A28O of the effluent was continuously monitored. 

15 Fractions identified by A28O as containing eluted 

material were then analyzed by SDS-PAGE through 14% gels 
to confirm the presence of the desired polypeptide. 

Those fractions containing proteins of 
interest were then pooled, followed by the addition of 

20 an equal volume of distilled water. The diluted sample 
was then loaded onto a previously prepared 450 mL (5 cm 
x 23 cm) column of S-Sepharose Fast Flow equilibrated 
with 0.4 M NaCl, 20 mM NaP04, pH 6.8 at 4'C. The column 
was washed with 2250 mL of 0.4 M NaCl, 20 mM NaP04, pH 

25 6.8 and the protein eluted using a 20 column volume 

linear gradient ranging from 0.4 M NaCl, 20 mM NaPC^, pH 
6.8 to 0.6 M NaCl, 20 mM NaP04, pH 6.8. Again, 50 mL 
fractions were collected under constant A28O monitoring 
of the effluent. Those fractions containing the protein 

30 (determined by 14% SDS-PAGE) were then pooled, followed 
by concentration through a YM-10 membrane (10,000 
molecular weight cutoff) in a 350cc stirring cell 
(Amicon, Inc. Mayberry, MA) to a volume of 30-40 mL. 
The concentrate was then loaded onto a 

35 previously generated 1,300 mL (4.4 cm x 85 cm) column of 
Superdex-75™ r sin (Pharmacia) quilibrated in column 
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buffer comprising lx PBS (Dulbecco 1 s Phosphate Buffered 
Saline, "D-PBS", calcium and magnesium- free) or 0.15 M 
NaCl, 20 mM NaP0 4 , pH 7.0. After allowing the sample to 
run into the column, the protein was eluted from the gel 
filtration matrix using column buffer. Thereafter. 10 
mL fractions were recovered and those containing the 
analog (determined by 14% SDS-PAGE) were pooled. 
Typically, the protein concentration was about 5-10 
mg/mL in the resultant pool. All of the above 
procedures were performed at 4-8 *C, unless otherwise 
specified. 



Analysis 

15 Analysis was conducted on B. coli-derived, 

native KGP? R(144)Q; C(l, 15)S/R(144)E; C(l, 15) S/R(144)Q 
and AN23/R(144)Q. 



20 



Conf ormat iona 1 «t^^j 1 i f, Y 



The polypeptides were compared by their 
storage stability, thermal unfolding transition 
temperatures (T m ) , and stability in a broad range of pH 
conditions . 

25 The ability of native KGF, R(144)Q, 

C(1,15)S/R(144)Q, C(1,15)S/R(144)E and AN23/R(144)Q 
to prevent aggregation at elevated temperatures was also 
examined. Samples containing 0.5 mg/mL of protein were 
prepared in D-PBS . 0.5 mL of each sample was aliquoted 

30 into 3 cc type-1 glass vials. The vials were sealed 
with rubber stoppers and 13 mm flip-off aluminum seals 
were crimped on. These vials were then placed in a 37°c 
incubator. At predetermined time intervals, vials were 
withdrawn and analyzed for the loss of soluble protein. 

35 Visible precipitat s were removed by centrifuging 250 jiL 
of each sample through a 0.22 pm Spin-X filter unit 
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(Costar, Cambridge, MA) . Soluble protein in the 
filtered solutions was subsequently analyzed by size 
exclusion HPLC. The amount of soluble protein was 
determined by integrating the HPLC peak area and 
5 plotting the result as a function of incubation time at 
37°C. The results are shown in Figure 11. 

The half -lives for the loss of soluble, 
monomeric protein were then estimated from these kinetic 
curves. Table 1 shows the half -life for remaining 
10 soluble KGF upon storage at 37°C for these proteins. 

Table 1 

Half -life for the Loss of Soluble. Monomeric Proteins 



Protein 


tl/2 (day) 


native KGF 


0.6 


R(144)Q 


4.1 


C(1,15)S/R(144)Q 


13.3 


AN23/R(144)Q 


22.3 


C(1.15)S/R(144)E 


38.0 



15 

As seen in Table 1. above, and Figure 11, the 
native KGF aggregated the most rapidly, with a half -life 
of 0.6 days. R(144)Q increased the half -life to 4.1 
days. C(1,15)S/R(144)Q, AN23/R(144)Q and 
20 C(l, 15)S/R(144)E showed substantial increases in the 
solubility half -life to 13.3, 22.3 and 38 days, 
r espec t ive ly . 

Thermal Unfolding 

25 

Thermal unfolding was monitored by circular 
dichroism (CD) at 230 run using a J-720™ 
spectropolarimeter (Jasco, Inc., Easton, MD) equipped 
with a PTC-343 Peltier-type temp rature control system. 
30 For CD analysis, separate samples containing 0.1 mg/mL 
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of the polypeptide to be analyzed were prepared in D-PBS 
(Life Technologies, Inc., Grand Island, NY). For each 
sample, about 2.5 mL was loaded into a 10 mm path length 
rectangular Suprasil™ quartz (Heraeus Quarzschmelze, 
5 GmbH, Hanau, Germany) fluorescent cell (Hellma Cells, 
Inc., Jamaica, NY). The cell was then placed into the 
Peltier-type temperature control system in the 
spectropolarimeter. Thermal unfolding was carried out 
at a rate of 50°C/hr. Changes in ellipticity were 

10 monitored at 230 nm to indicate unfolding. The T m of 
each sample was estimated by identifying a temperature 
at which 50% of protein molecules in the solution were 
unfolded (Biophysical Chemistry, Cantor and Schimmel 
(eds), W.H. Freeman and Co. San Francisco (1980)). The 

15 estimated T m for each of the three proteins is listed in 
Table 2. 

Table 2 

Estimated Melt-inn Tempgratnrgp 

20 



Protein 


Tm (°C) 


native KGF 


54.0 


R(144)Q 


61.5 


C(1,15)S/R(144)Q 


62.5 


dN23/R(144)Q 


63.0 


C(1.15)S/R(144)E 


63.5 



As the results show, R(144)Q has a greater than 7°C 
increase in the Tm as compared with native KGF. The 
substitution of R(144)Q to C(l, 15)S/R(144)Q or AN23 adds 
at least another 1°C increase in T m and more than 8°C as 
compared with native KGF. Moreover, the 
C(1,15)S/R(144)E is greater than 9°C more stable than 
native KGF . Therefore, switching a positiv ly charged 
residue (Arg) at amino acid position 144 to a neutrally 
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or negatively charged residue substantially stabilized 
the polypeptide. 

EH 

5 

The acid stabilities of C(1,15)S/R(144)Q and 
C(l, 15)S/R(144)E were also compared to that of native 
KGF, by adjusting D-PBS to different pH values by adding 
concentrated HCl or NaOH. Approximately 2.35 mL of 

10 D-PBS at different pH values was mixed with 100 of 

2.45 mg/mL KGF protein in a quartz cell. These samples 
were thermally unfolded at a rate of 50°C/hr and 
monitored by CD at 230 run. Figure 12 shows the T m as a 
function of pH for native KGF, C(l, 15)S/R(144)Q and 

15 C(1,15)S/R(144)E. In the pH range tested, the 

C(1,15)S/R(144)Q and C (1, 15) S/R(144)E always have a 
higher T m than the native KGF. 

In vitrv PiPlpgicca Activity 

20 

In vitro mitogenic activity of R(144)Q, 
AN23/R(144)Q, C<1, 15) S/R(144)Q and C (1, 15)S/R(144)E was 
also determined as a function of protein concentration 
and the half -maximal concentrations by measurement of 

25 [ 3 H] -thymidine uptake by Balb/MK cells (according to the 
methods of Rubin et al. (1989), supra). 

Generally, the concentrations of each of the 
KGF analogs relative to a known standard native KGF was 
determined using an in vitro biological assay. Each KGF 

30 analog was then diluted and assayed for biological 

activity using a Balb/MK mitogenic assay. The samples 
were first diluted in a bioassay medium consisting of 
50% customer-made Eagle 1 s MEM, 50% customer-made F12, 
5 jiG/mL transferrin, 5 ng/ml sodium selenite, 0.0005% 

35 HSA and 0.005% Tween 20. KGF samples were then added 
into Falcon primer ia 96-well plates seeded with Balb/MK 



WO 96/11951 



- 38 - 



PCI7US95/13075 



cells. Incorporation of [ 3 H] -Thymidine during DNA 
synthesis was measured and converted to input native KGF 
concentration by comparison to a native KGF standard 
curve. The results are presented in Figures 13 to 16. 

5 As seen in Figures 13 to 16, each of the KGF analogs has 
mitogenic activity. 

While the present invention has been described 
above both generally and in terms of preferred 
embodiments, it is understood that other variations and 

0 modifications will occur to those skilled in the art in 
light of the description above. 
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WHAT IS CLAIMED IS: 

1. A polypeptide analog of native KGF comprising 
a charge -change by the deletion or substitution of one 

5 or more of amino acid residues 41-154 of Figure 2 (amino 
acids 72-185 of SEQ ID N0:2). 

2 . The polypeptide analog according to Claim 1 
wherein the deleted or substituted amino acids are 

10 selected from the group consisting of Arg 41 , Gin 43 , 
Lys 55 , Lys* 5 , Ly S 128 # asx^ 7 . Gin 138 , Lys 13 *, Arg* 44 , 
Lys* 47 , Gin 152 , Lys 153 and Thr 154 . 

3. The polypeptide analog according to Claim 1 
15 selected from the group consisting of R(144)Q, 

C{1,15)S/R(144)Q, C{1,15)S/R(144)E and AN23/R (144) Q. 

4. A pharmaceutical formulation comprising a 
therapeutically effective amount of a polypeptide analog 

20 of KGF according to Claim 1 and a pharmaceutical^ 
acceptable carrier* 

5. A pharmaceutical formulation comprising a 
therapeutically effective amount of a lyophilized 

25 polypeptide analog of KGF according to Claim 1. 

6. The pharmaceutical formulation of Claim 4 
further comprising a pharmaceutically acceptable 
carrier. 

30 

7. A nucleic acid molecule selected from the 
group consisting of DMA and RNA wherein the nucleic acid 
mol cule encodes a polypeptide analog of native KGF 
comprising a charge-change by the deletion or 

35 substitution of one or mor of amino acid residues 41- 
154 of Figure 2 (amino acids 72-185 of SEQ ID N0:2). 



WO 96/11951 



- 40 - 



PCT/US9S/13075 



10 



8. The nucleic acid molecule according to Claim 7 
wherein the deleted or substituted amino acids are 
selected from the group consisting of Arg 41 , Gin 43 , 

5 Lys55, Lys*5, L y S 128 § Asn 1 ", Gln"8 # Lys 139 r Arg* 44 , 
Lys 147 , Gin 152 , Ly S l53 and Thr 15 4 . 

9. The nucleic acid molecule according to Claim 7 
wherein the polypeptide analog is selected from the 
group consisting of R(144)Q, C (1, 15) S/R (144) Q, 

C(l, 15)S/R(144)E and AN23/R(144)Q. 

10. A biologically functional plasmid or viral 
vector comprising a nucleic acid molecule according to 

15 Claim 7. 

11. A procaryotic or eucaryotic host cell stably 
transfected or transformed with a biologically 
functional vector according to Claim 8. 

12. A procaryotic host cell according to Claim 11 
that is E. coli. 

13. A eucaryotic host cell according to Claim 11 
25 that is a mammalian cell. 

14. A eucaryotic host cell according to Claim 12 
that is a Chinese hamster ovary cell. 

30 15. A process for the production of a polypeptide 

analog of KGF, the process comprising growing under 
suitable nutrient conditions a procaryotic or eucaryotic 
host cell stably transformed with a nucleic acid 
molecul according to Claim 7, in a manner allowing 

35 expression of the encoded polypeptide analog, and 
isolating the polypeptide analog so produced. 



20 
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16. A method of stimulating the production of 
non- fibroblast epithelial cells comprising contacting 
such cells with an effective amount of a polypeptide 
5 analog of a KGF according to Claim 1 . 
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Figur* 1 

human KGF (+ signal sequenc ) 

I OLIGOil I 

5 * CAATGACCTAGGAGTAACAATGAAC 3' 

5 ' CAATCTACAATTCACAGATAGGAAGAGGTCAATGACCTAGGAGTAACAATCAACTCAAGA- 
+ + + + + + g0 

-TTCATTTTCATTATGTTATTCATGAACACCCGGAGCACTACACTATAATGCACAAATGGA- 
+ + + + + + 12Q 

M H K W I 

-TACTGACATGGATCCTGCCAACTTTGCTCTACAGATCATGCTTTCACATTATCTGTCTAG- 

+ + + + + + 180 

LTWILPTLLYRSCFHI ICLV 

-TGGGTACTATATCTTTAGCTTGCAATGACATGACTCCAGAGCAAATGGCTACAAATGTGA- 
+ + + + ♦ + 240 

GTISLACNDMTPEQMATNVN 

-ACTGTTCCAGCCCTGAGCGACACACAAGAAGTTATGATTACATGGAAGGAGGGGATATAA- 

~~~ ~ + + + + + + 300 

CSSPE RHTRSYDYMEGGDIR 

-GAGTGAGAAGACTCTTCTGTCGAACACAGTGGTACCTGAGGATCGATAAAAGAGGCAAAG- 

"7,"Z * * + + ~ + + 360 

VRRLFCRTQWYLRIDKRGKV 

-TAAAAGGGACCCAAGAGATGAAGAATAATTACAATATCATGGAAATCAGGACAGTGGCAG- 
_ + + + + + + 42Q 

KGTQEMKNNYNIMEIRTVAV 

-TTGGAATTGTGGCAATCAAAGGGGTGGAAAGTGAATTCTATCTTGCAATGAACAAGGAAG- 

~ + + + + ♦ + 480 

GIVAIKGVESEFYLAMNKEG 

-GAAAACTCTATGCAAAGAAAGAATGCAATGAAGATTGTAACTTCAAAGAACTAATTCTGG- 
+ + + + + + 54Q 

KLYAKKECNEDCNFKELILE 

-AAAACCATTACAACACATATGCATCAGCTAAATGGACACACAACGGAGGGGAAATGTTTG- 
+_ + + + + + 60Q 

NHYNTYASAKWTHNGGEMFV 

-TTGCCTTAAATCAAAAGGGGATTCCTGTAAGAGGAAAAAAAACGAAGAAAGAACAAAAAA-" 
+ + ♦ + + + ggo 

ALNQKGIPVRGKKTKKEQKT 

-CAGCCCACTTTCTTCCTATGGCAATAACTTAATTGCATATGGTATATAAAGAACCCAGTT 
+ + ♦ + + + 720 

AHFLPMAIT* 
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-CCAGCAGGGAGATTTCTTTAAGTGGACTGTTTTCTTTCTTCTCAAAATTTTCTTTCCTTT 
+ + + + + + 780 

-TATTTTTTAGTAATCAAGAAAGGCTGGAAAAACTACTGAAAAACTGATCAAGCTGGACTT 
+ + + + + + 84Q 

3 * ACCTGAA- 

| 

-GTGCATTTATGTTTGTTTTAAG 3 • 
+ 862 

-CACGTAAATACAAACAAAA 5' 
OLIGOI2 1 
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Figure 3 

RSH-KGF 

plasmid ONA Clal Xbal Ndel 

sequence 5 • - ATC GAT T TGAT TC T AGAAGG AGGAATAACAT ATGAAAAAG - 

M K K 

RSH signal sequence Mlul 
-CGCGCACGTGCTATCGCCATTGCTGTGGCTCTGGCAGGTTTCGCAACTAGTGCACA- 3 • 
RARAIAIAVALAGFATSAHA- 

Mlul 

5 ' CGCGTGCAATGACATGACTCCAGAGCAAATGGCTACAAATGTGAACTGTTCCAGCCCTGA- 
+ + + + + + 60 

-CNDMTPEQMATNVNCSSPE 

-GCGACACACAAGAAGTTATGATTACATGGAAGGAGGGGATATAAGAGTGAGAAGACTCTT- 
+ + + + + + 12Q 

RHTRSYDYMEGGD IRVRRLF 
Kpnl Clal 

-CTGTCGAACACAGTGGTACCTGAGGATCGATAAAAGAGGCAAAGTAAAAGGGACCCAAGA- 
+ + + + + _ + 18Q 

CRTQWYLRIDKRGKVKGTQE 
-GATGAAGAATAATTACAATATCATGGAAATCAG<jACAGTG<MAGTTGGAATTGTGGCAAT- 

+ + + + + + 240 

MKNNYNIME IRTVAVGIVAI 
ECORI 

-CAAAGGGGTGGAAAGTGAATTCTATCTTGCAATGAACAAGGAAGGAAAACTCTATGCAAA- 
+ + + + + 30Q 

KGVESEFYLAMNKEGKLYAK 
BsmI 

-GAAAGAATGCAATGAAGATTGTAACTTCAAAGAACTAATTCTGGAAAACCATTACAACAC- 
+ + + + + + 36Q 

KECNEDCNFKELI LENHYNT 
Ndel 

-ATATGCATCAGCTAAATGGACACACAACGGAGGGGAAATGTTTGTTGCCTTAAATCAAAA- 

♦ ♦ + ♦ + ♦ 420 

YASA KWTHNGGEMFVALNQK 

HKGGATTCCTGTAAGAGGAAAAAAAACGAAGAAAGAACAAAAAACAGCCCACTTTCTTCC- 
+— — + + + + + 480 

G I PVRGKKTKKEQ KTAH FLP 

BamHI 

-TATGGCAATAACTTAATAG 3' -plasmid DNA 

— — -+ — 503 -s qu nc 

M A I T * 
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Figura 4 

KGF 

Ndel 

5 ' TATGTGCAATGACATGACTCCAGAGCAAATGGCTACAAATGTGAACTGTTCCAGCCCTGA- 
+ + + ♦ + + 60 

MCNDMTPEQMATNVNCSSPE 

-GCGACACACAAGAAGTTATGATTACATGGAAGGAGGGGATATAAGAGTGAGAAGACTCTT- 
+ + + + + + 12 q 

RHTRSYDYMEGGDIRVRRLF 
Kpnl Clal 

-CTGTCGAACACAGTGGTACCTGAGGATCGATAAAAGAGGCAAAGTAAAAGGGACCCAAGA- 
+ + + + + + 180 

CRTQWYLRI DKRGKVKGTQE 

-GATGAAGAATAATTACAATATCATGGAAATCAGGACAGTGGCAGTTGGAATTGTGGCAAT- 
+ + + + + + 240 

MKNNYNIMEIRTVAVGIVAI 
EcoRI 

-CAAAGGGGTGGAAAGTGAATTCTATCTTGCAATGAACAAGGAAGGAAAACTCTATGCAAA- 
+ + + + + + 3Q-0 

KGVESEFYLAMNKEGKLYAK 
BsmZ 

-GAAAGAATGCAATGAAGATTGTAACTTCAAAGAACTAATTCTGGAAAACCATTACAACAC- 
+ + +_ + + + 360 

KECNEDCNFKELILENHYNT 
Ndel 

-ATATGCATCAGCTAAATGGACACACAACGGAGGGGAAATGTTTGTTGCCTTAAATCAAAA- 

+ + + + + + 420 

YASAKWTHNGGEMFVALNQK 

-GGGGATTCCTGTAAGAGGAAAAAAAACGAAGAAAGAACAAAAAACAGCCCACTTTCTTCC- 

+ + + + + + 480 

GIPVRGKKTKKEQKTAHFLP 

BamHl 

-TATGGCAATAAC7TAATAG 3* 
+ + 503 

M A I T * 
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Figur* 5 

subsitution of Kpnl to EcoRI sequenc to make KGF(dsd) 
Kpnl 

I 0LIG0#6 1 | 0LIG0#7 

5 1 CTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAACAACTACAAT- 
3 ' CATGGACGCATAGCTGTTTGCGCCGTTTCAGTTCCCGTGGGTTCTCTACTTTTTGTTGATGTTA- 

I 0LIG0I9 1 I — OLIGOI10 

-YLRIDKRGKVKGTQEMKNNYN- 

ECORI 

1 | OLIGOI8 1 

5 ' -ATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTG 3 
3 ' -TAATACCTTTAGGCATGACAACGACAACCATAGCAACGTTAGTTTCCACAACTTAGACTTAA 5 

I , OLIGO#11 1 

IME IRTVAVGIVAIKGVESEF- 
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FIGURE 6 

KGF (codon optimized) 

Xbal 

I OLIGO#12 1 

5 'AGTTTTGATCTAGAAGGAGG 3' 

I 0LIG0#14 

5 • AGTTTTGATCTAGAAGGAGGAATAACATATGTGCAACGACATGACTCCGGAACAGATGGCT- 

I | OLIGOI15 

-ACCAACGTTAACTGCTCCAGCCCGGAACGTCACACCCGTAGCTACGACTACATGGAAGGTG- 

3» GGGCCTTGCAGTGTGGGCAT 5' 
I OLIGO#20 1 

0LIG0#15 1 i_ 

-GTGACATCCGTGTTCGTCGTCTGTTCTGCCGTACCCAGTGGTACCTGCGTATCGACAAACG- 

3' ATAGCTGTTTGC- 
|-OLIGO#21~ 

OLIGO#16 

-TGGTAAAGTTAAAGGTACCCAGGAAATGAAAAACAACTACAACATCATGGAAATCCGTACT- 
-ACCATTTC 5* 



I | OLIGO#17 

-GTTGCTGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTGAATTCTACCTGGCAATGAACA- 
3' ACGTTAGTTTCCACAACTTA 5' 
I OLIGO#22 1 

OLIGO#17 1 | 

-AAGAAGGTAAACTGTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAGAACTGAT- 

3 ' GAAGTTTCTTGACTA- 
I OLIGO#23 

OLIGO#18 

-CCTGGAAAACCACTACAACACCTACGCATCTGCTAAATGGACCCACAACGGTGGTGAAATG- 
-GGACC 5* 



1 | 0LIGO#19 

-TTCGTTGCTCTGAACC»GAAAGGTATCCCGGTTCGTGGTAAAAAAACCAAAAAAGAACAGA- 
3« GGTCTTTCCATAGGGCCAAG 5' 
I OLIGO#24 1 

OLIGO#19 1 

-AAACCGCTCACTTCCTGCCGATGGCAATCACTTAATAGGATCCAGTTTTGA 3 1 

3' AATTATCCTAGGTCAAAACT 5' 

I -OLIGO#13 1 

BamHI 
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rigur* 7 

KGF R(144)Q 

5 • ATGTGCAATGATATGACTCCTGAACAAATGGCTACCAATGTCAACTGTTCCTCTCCGGAG- 
+ + + + + + 6Q 

MCNDMTPEQMATNVNCSSPE 

-CGCCACACCCGGAGTTACGATTACATGGAAGGTGGGGATATTCGCGTACGTCGTCTGTTC- 
— + + + + + + 12 o 

RHTRSYDYMEGGDI RVRRLF 

-TGCCGTACCCAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAG- 

+ + + + + + 18 o 

CRTQWYLRIDKRGKVKGTQE 

-ATGAAAAACAACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATC- 

+ + + + + + 240 

MKNNYNIME IRTVAVGIVAI 

-AAAGGTGTTGAATCTGAATTCTATCTTGCAATGAACAAGGAAGGAAAACTCTATGCAAAG- 

+ + + + + + 300 

KGVESEFYLAMNKEGKLYAK 

-AAAGAATGCAATGAAGATTGTAACTTCAAAGAACTAATTCTGGAAAACCATTACAACACA- 

+ + + + + + 36O 

KECNEDCNFKELILENHYNT 

-TATGCATCTGCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAA- 
- + ~ + + + — + 420 

YASAKWTHNGGEMFVALNQK 

-GGTATCCCTGTTCAAGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCG- 

+ + + + + + 48o 

GIPVQGKKTKKEQKTAHFLP 

-ATGGCAATCACTTAA 3' 
+ 495 

M A I T * 
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TLgvr 8 

KGF C(1,15)S/R(144)E 
5 • ATGTCTAATGATATGACTCCGGAACAGATGGCTACCAACGTTAACTCCTCCTCCCCGGAA- 

MSNDMTPEQMATNVNS ~S + ~S~~P~~E~ + 6 ° 
-CGTCACACGCGTTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTC- 

RHTRSYDYMEGG DIRV ~R + ~R~"l~~F~ + 
-TGCCGTACCCAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAG- 

crtqwylridkrgk v~~k~~g + "t"~q~"e~ + 180 
-atgaaaaacaactacaatattatggaaatccgtactgttgctgttggtatcgttgcaatc- 

MKNNYNIMEIRT v""a"~v"~G~I + "v~~A~~I~ + 
-AAAGGTGTTGAATCTGAATTCTATCTTGCAATGAACAAGGAAGGAAAACTCTATGCAAAG- 

kgvesefylamnkegkl + ya ~k~ + 300 

KECNEDCNFKELI l"~e""n""h + T"n""t" + 

-tatg^tctgctaaatggacccacaacggtggtgaaatgttcgttgctctgaaccagaaa- 
yasakwthnggemfval + n "q~~k~* 420 

"f?I™ff TGTTGA ^ T ^ GAA ^ CAAG '^^ 

gipvegkktk + k""e"~q""k~"t""a""h + "f"~l~"p~ + 480 

-atggcaatcacttaa 3» 
+ 495 

M A I T * 
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Figur* 9 

KGF C(1,15)S/R144Q 

5 ' ATGTCTAATGATATGACTCCGGAACAGATGGCTACCAACGTTAACTCCTCCTCCCCGGAA- 
+ + _ + + + + g 0 

MSNDMTPEQMATNVNSSSPE 

-CGTCACACGCGTTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTC- 
+ + + + + + 12Q 

RHTRSYDYMEGGDIRVRRLF 

-TGCCGTACCCAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAG- 
+ + + + + + 18Q 

CRTQWYLRI DKRGKVKGTQE 

-ATGAAAAACAACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATC- 
+ + + + + + 24Q 

MKNNYNIME IRTVAVGI VAI 

-AAAGGTGTTGAATCTGAATTCTATCTTGCAATGAACAAGGAAGGAAAACTCTATGCAAAG- 
+ + + + + + 3Q0 

KGVESEFYLAMNKEGKLYAK 

-AAAGAATGCAATGAAGATTGTAACTTCAAAGAACTAATTCTGGAAAACCATTACAACACA- 
+ + + + + + 360 

KECNEDCNFKELILENHYNT 

-TATGCATCTGCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAA- 
+ + + + + + 42Q 

YASAKWTHNGGEMFVALNQK 

-GGTATCCCTGTTCAAGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCG- 
+ + + ^ + + 48Q 

GIPVQGKKTKKEQKTAHFLP 

-ATGGCAATCACTTAA 3' 
+ 

M A I T * 
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Figure 10 

KGF AN23/R(144)Q 

5 ' ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC- 
+ + + + + + g0 

MSYDYMEGGDIRVRRLFCRT 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC- 
+ + + + + + 12Q 

QWYLRI DKRGKVKGTQEMKN 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 
+ + + + + + 18Q 

NYNIMEIRTVAVGIVAIKGV 

-GAATCTGAATTCTATCTTGCAATGAACAAGGAAGGAAAACTCTATGCAAAGAAAGAATGC- 
+ + + + + + 24Q 

ESEFYLAMNKEGKLYAKKEC- 

-AATGAAGATTGTAACTTCAAAGAACTAATTCTGGAAAACCATTACAACACATATGCATCT- 
+ + + + + + 3Q0 

NEDCNFKELILENHYNTYAS 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGTATCCCT- 
+ + + + + + 3g0 

AK WTHNGGEMFVALNQKG I P 

-GTTCAAGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCGATGGCAATC- 
+ + + + + + 42Q 

VQ6KKTKKE. QKTAHFLPHAZ 

-ACTTAA 3* 

426 

T * 
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Figure 11 
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Figure 12 
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Figure 13 



wt vs. R144Q 
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Figure 16 
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Figure 17 

AN23/N(137)E 

5 1 -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC- 
+ + + + + + 

MSYDYMEGGDIRVRRLFCRT- 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC- 
+ + + + + + 

QWYLRIDKRGKVKGTQEMKN- 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 
+ + + + + + 

NYNIMEI RTVAVGIVAIKGV- 

-GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGCAAAAAAAGAATGC- 
+ + + + + + 

ESEFYLAMNKEGKLYAKKEC- 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 
+ + + + 

NEDCNFKELILENHYN T~ y""a""s""- 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGGAACAGAAAGGTATCCCT- 
* + + , 

AKWTHNGGEMFVALEQK + g"i~~p"~- 
- GTTCGTGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCG ATGGCAATC - 

VRGKKTKKEQKTAH + FLP V"a""i""- 

-ACTTAA-3 1 

+ 

T * 
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Figure 18 

AN23/K(139)E 

5 1 -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC- 

+ + + 4- + + 

MSYDYMEGGDIRVRRLFCRT- 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC 

+ + + + + + 

QWYLRIDKRGKVKGTQEM KN- 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 

+ + + + + + 

NYNIMEIRTVAVGIVAIKGV- 

-GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGCAAAAAAAGAATGC- 

+ + + + + + 

ESEFY-LAMNKEGKLYAKKEC 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT 

+ + + + + + 

NEDCNFKELILENHYNTYAS 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGGAAGGTATCCCT- 

+ + + + + + 

AKWTHNGGEMFVALNQEGI P- 

-GTTCGTGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCGATGGCAATC- 

+ + + + + + 

VRGKKTKKEQKTAHFLPMAI - 

-ACTTAA-3 * 
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Figure 19 

AN23/K(139)Q 

5 ' -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC 

+ + + + + + 

MSYDYMEGGDIRVRRLFCRT 

- C AGTGGTACC TGCGTATCGAC AAACGCGGCAAAGTCAAGGGC ACCC AAGAGATGAAAAAC 
QWYLRIDKRGKVKGTQEMKN 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 

+ + + + + + 

NYNIMEIRTVAVGIVAIKGV 

-GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGCAAAAAAAGAATGC 

ESEFYLAMNKEGKLYAKKEC 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT 

+ + + + + + 

NEDCNFKELILENHYNTYAS- 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGCAAGGTATCCCT- 

+ + + + 4 + 

AKWTHNGGEMFVALNQQGI P ■ 

- GT TCG TGGTAAGAAAAC C AAGAAAGAAC AGAAAACCGCTCAC TTC CTGCCGATGGC AATC ■ 

+ + + + + + 

VRGKKTKKEQKTA HFLPMAI 

-ACTTAA-3 1 
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Figure 20 

AN23/R(144)A 

5 ' - ATGTCCTACGACTAC ATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC - 
+ + + + + + 

MSYDYMEGGDIRVRRLFCRT- 

- C AGTGGTACCTGCGTATCGAC AAACGCGGCAAAGTC AAGGGCACCC AAGAGATG AAAAAC - 
+ + + + + + 

QWYLRIDKRGKVKGTQEMKN- 
-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 

NYNIMEIRTVAVGIVAI + KGV- 
- G AATCTGAATTCTACCTGGCAATGAAC AAAGAAGGTAAACTGTACGC AAAAAAAGAATGC - 

ESEFYLAMNKEGKLYAK + KEC- 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 
+ + + + + + 

NEDCNFKELILENHYNTYAS- 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGTATCCCT- 
+ + + + + + 

AKWTHNGGEMFVALNQKGI P- 

-GTTGCTGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCGATGGCAATC- 
+ + + + + + 

VAGKKTKKEQKTAHFLPMAI- 

-ACTTAA-3 ' 

+ 

T * 
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Figure 21 

AN23/R(144)L 

5 ' -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC 
+ + + + + + 

MSYDYMEGGDI RVRRLFCRT 
-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC 



QWYLRI DKRGKVKGTQEMKN 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT 
+ + + + + + 

NYNIMEIRTVAVGIVAI KGV 

-GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGCAAAAAAAGAATGC- 

+ + + + + + 

ESEFYLAMNKEGKLYAKKEC- 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 

+ + + + + + 

NEDCNFKELILENHYNTYAS - 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGTATCCCT- 

+ + + + + : -+ 

AKWTHNGGEMFVALNQKGI P - 

* GTTCTGGGTAAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCGATGGCAATC - 
+ + + + + + 

VLGKKTKKEQKTAHFLPMAI - 
-ACTTAA-3 ' 



T * 
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Figure 22 

AN23/K(147)E 

5 1 -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC 
+ + + + + + 

MSYDYMEGGDIRVRRLFCRT 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC 

+ + + + + + 

QWYLRIDKRGKVKGTQEMKN- 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 
+ + + + + + 

NYNIMEIRTVAVGIVAIKGV- 

- GAATCTGAATTCTACCTGGC AATGAAC AAAGAAGGTAAACTGTACGCAAAAAAAGAATGC - 
+ + + + + + 

ESEFYLAMNKEGKLYAKKEC- 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 
+ + + + + + 

NEDCNFK ELILEN HYNTYAS- 

- G C TAAATGG ACCC ACAACGGTGGTGAAATGTTCGTTGCTCTGAAC CAGAAAGGTATTCCT - 
+ + + + + + 

AK'WTHNGGEMFVALNQKGI P - 

-GTTCGTGGTAAGGAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCCGATGGCAATC- 

+ + + + + + 

VRGKETKKEQKTAHFLPMAI 

-ACTTAA-3 ' 



25/29 



WO 96/11951 



PCT/US95/13075 



Figure 23 

AN23/K(147)Q 

5 ' -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC- 
+ + + + + + 

MSYDYMEGGDIRVRRLFCRT- 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC- 
+ + + + + + 

QWYLRIDKRGKVKGTQEMKN- 

-aactacaatattatggaaatccgtactgttgctgttggtatcgttgcaatx:aaaggtgtt- 
+ + + + + + 

nynimeirtvavgivaikgv- 

-gaatctgaattctacctggcaatgaacaaagaaggtaaactgtacgcaaaaaaagaatgc- 
+ + + + + + 

ESEFY. LAMNKEGKLYAKKEC- 

NEDCNFK + ELI T"e"n"h + "y""n" W"a"s"- 
-GCTAAATGGACCCAC^ 

akwthn G + ge h"T77T777T77"- 

VRG KQTK + KE Q~TTTT~F~"7"p~ + m"a~T~- 
-ACTTAA-3 ' 
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Figure 24 

AN23/K(153)E 

5 * -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC 
+ + + + + + 

MSYDYMEGGDIRVRRLFCRT 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC 
+ + + + + + 

Q.WYLRIDKRGKVKGTQEMKN- 

- AACTAC AATATTATGGAAATCCGTAC TGTTGCTGTTGGTATCGTTGC AATC AAAGGTGTT • 
+ + + + + + 

NYNIMEIRTVAVGIVAIKGV- 

- GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGC AAAAAAAGAATGC ■ 
+ + + + + + 

ESEFYLAMNKEGKLYAKKEC- 

•AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 

+ + + + + + 

NEDCNFKELILENHYNTYAS 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGTATCCCT- 

+ + + + + + 

AKWTHNGGEMFVALNQ KG I P - 

- GTTCGTGGTAAGAAAACCAAGAAAGAACAGGAAACCGCTC ACTTCCTGCCGATGGC AATC ■ 
+ + + + + + 

VRGKKTKKEQETAHFLPMAI - 

-ACTTAA-3 ' 
+ 
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Figure 25 

AN23/K(153)Q 

5 * -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC- 
+ + + + + + 

MSYDYMEGGDIRVRRLFCRT- 

- CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC - 
+ + + + + ; + 

QWYLRIDKRGKVKGTQEMKN- 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 
+ + + + + + 

NYNIMEIRTVAVGIVAIKGV- 

- GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGC AAAAAAAGAATGC - 
+ + + + + + 

ESEFYLAMNK EGKLYAKKEC- 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 

+ + + + + + 

NEDCNFKELILENHYNTYAS 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGTATCCCT- 
+ + + + + + 

AKWTHNGGEMFVALNQKGI P - 

- GTTCGTGGTAAGAAAACCAAGAAAGAAC AGC AAACCGCTC ACTTCCTGCCGATGGC AATC - 



+ - 



VRGKKTKKEQQTAHFL PMAI 



-ACTTAA-3 • 
+ 

T * - 
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Figure 26 

AN23/Q(152)E/K(153)E 

5 1 -ATGTCCTACGACTACATGGAAGGTGGTGACATCCGCGTACGTCGTCTGTTCTGCCGTACC- 
+ +. + + + 

MSYDYMEGGDIRVRRLFCRT- 

-CAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCCAAGAGATGAAAAAC- 
+ + + + + + 

QWYLRIDKRGKVKGTQEMKN- 

-AACTACAATATTATGGAAATCCGTACTGTTGCTGTTGGTATCGTTGCAATCAAAGGTGTT- 
+ ♦ + + + + 

NYNIMEIRTVAVGIVAIKGV- 

-GAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAACTGTACGCAAAAAAAGAATGC- 

ESEFYLAMNKEGKL + YA k" 4 k"e""c""- 

-AACGAAGACTGCAACTTCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT- 
+ + + + + + 

nedcnfkelilenhyntyas- 

-GCTAAATGGACCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGTATCCCT- 
+ ^ + + + + 

AKWTHNGGEMFVALNQKGI p- 
- GTTCGTGGTAAGAAAACCAAGAAAGAAGAGGAAACCGCTC ACTTC CTGC CG ATGGC AATC - 

vrgkktkkeee "t""a""h + "f"l""p" + m""a""i""- 

-ACTTAA-3 ' 
+ 

T . * 
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